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We present a family of algorithms, combining real-space renormalization methods and belief 
propagation, to estimate the free energy of a topologically ordered system in the presence of defects. 
Such an algorithm is needed to preserve the quantum information stored in the ground space of 
a topologically ordered system and to decode topological error-correcting codes. For a system 
of linear size £, our algorithm runs in time log^ compared to £ 6 needed for the minimum- weight 
perfect matching algorithm previously used in this context and achieves a higher depolarizing error 
threshold. 



Topologically ordered phases of matter can be used to 
store and process quantum information in an inherently 
robust way [TH5]. The ground state degeneracy depends 
on the topology of the system. Quantum information 
stored in this ground state manifold is protected from 
local perturbations because virtual transitions require an 
order in perturbation proportional to the linear size of the 
system. 

At finite temperature, thermal excitations create pairs 
(or larger sets) of particles of finite mass. These parti- 
cles are not confined and can freely diffuse on the sur- 
face. Bringing all the particles to fuse in pairs returns 
the system into a ground state. This ground state will 
correspond to the original one only if the world-line of 
the particles has a trivial homology: otherwise the parti- 
cles have generated a topological transformation that cor- 
rupts the information stored in the ground space. Since 

mass 

particles appear in a finite density (e k s T ) at any non- 
zero temperature, this information corruption happens 
on a time-scale independent of the system size [6 . 

To store the information for longer times, it becomes 
necessary to keep track of the thermal particles and have 
accurate knowledge of their world-line homology. This 
is possible if the locations of the particles are measured 
repeatedly on a time-scale shorter or comparable to their 
diffusion rate. It is also necessary to process the infor- 
mation gathered from these measurements rapidly, i.e., 
to infer the world-line homology from knowledge of the 
particle configuration at discrete times. 

For some models, such as color codes [7], there is no 
known efficient algorithm that can infer the particles' 
world-line homology. Other models, such as Kitaev's 
code, admit an efficient algorithm — the minimum- weight 
perfect matching algorithm of Edmonds [8 j (PMA) — that 
solves this problem in time proportional to £ 6 where i is 
the linear size of the system [HE]. With such a scaling, 
it is not possible to handle lattices of more than a few 
hundred sites long in any reasonable time, ruling out any 
practical application in this context. Additionally, this 
algorithm is sub-optimal. As we show below, decoding 
a topological code reduces to minimizing the free-energy 
over all homology classes of the system. The PMA min- 
imizes the energy instead, which can lead to additional 



corruption of the information. 

In this Letter, we present a real-space renormaliza- 
tion group (RG) algorithm to accomplish this task in 
time log^ (based on a parallel architecture, or £ 2 \og£ 
in series). The algorithm is not exact and makes use 
of mean-field equations implemented as a belief propaga- 
tion algorithm (see [10]). Despite these approximations, 
our algorithm achieves a successful decoding of Kitaev's 
model up to a tolerable noise threshold (diffusion rate) 
that exceeds the one obtained by the PMA. Our algo- 
rithm is suitable for a large class of topologically ordered 
systems including color codes and can be adapted to sur- 
faces of arbitrary geometries and topologies. 

Our algorithm is also of use for fault-tolerant quantum 
information processing schemes based on topological er- 
ror correcting codes g| [TTHT3] . This method achieves 
a high (0.75%) fault-tolerant quantum computing error 
threshold by making use of some of the ideas outlined 
above to encode and manipulate the quantum informa- 
tion in a planar architecture, but do not require a topo- 
logically ordered phase of matter. In this setting, our al- 
gorithm becomes an efficient decoder, inferring the most 
likely logical recovery given the error syndrome. 

Finally, our algorithm should be of interest in con- 
densed matter more generally because it provides a nu- 
merical method to study topological to quenched disorder 
phase transitions. We note that real-space RG methods 
have been devised for systems with topological order [14] , 
but to our knowledge they do not apply in the presence 
of disorder. 

Toric Code — In the following, we focus on Kitaev's toric 
code. It consists of a i x i two-dimensional square lattice 
embedded on a torus. Qubits are located on the edges 
of the lattice. For each site s, define an operator A s and 
for each plaquette p (site of the dual lattice), define an 
operator B p by 

A s = (g) X q , B p = 0Z 9 (1) 

<?Gn(s) q£n(p) 

where n(s) consists of the 4 neighboring qubits of site s, 
n(p) of the 4 neighboring qubits of plaquette p and X 
and Z are Pauli matrices. We refer to these operators 
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collectively as stabilizer generators. 

The toric codespace is the ground state manifold of the 
Hamiltonian: 

« = -5>.-J> P ( 2 ) 

s p 

Observe that the stabilizer generators all commute with 
one another. Thus, the ground space of H is their si- 
multaneous +1 eigenspace. This implies that the stabi- 
lizer group, formed of arbitrary products of its generators 
{A S ,B P }, acts trivially on the codespace 

0|^gs) = |^gs> Oe({A s ,B p }). (3) 

Because they consist of unit loops, one can view the 
Bp (A s ) as generators of the group of contractible loops 
on the (dual) lattice. Thus, any contractible loop of Z 
(X) operators on the (dual) lattice acts trivially on the 
codespace. In fact, any operator that maps the codespace 
to itself must consist of closed loops of X (or Z) operators 
because, in order to commute with all A s (B p ), they must 
intersect every site (plaquette) an even number of times. 
We conclude that only non-contractible loops can map 
the ground space to itself in a non-trivial manner. More 
specifically, for two non-contractible loops on the lattice, 
L\ winding around the hole and L 2 winding around the 
body of the torus, and two non-contractible loops on the 
dual lattice, L[ winding around the body and L' 2 winding 
around the hole, these operators are 

Z % = H Z q , X v = [] *q- ( 4 ) 

One can easily verify that these logical operators com- 
mute with the stabilizer group and obey the canoni- 
cal commutation relations [Xi,Xj] = [Zi,Zj] = and 
XiZj = {—l) 5 ^ZjXi. Thus, these four loop operators 
form a Pauli algebra of 2 effective qubits encoded in the 
topological degrees of freedom of the ground space. The 
different combinations of these operators generate the 16 
different homology classes that need to be distinguished 
in order to prevent the corruption of the information. 

Errors & Decoding — To characterize the error-correcting 
capacities of this code, we model the noise by the depo- 
larizing channel. This is a noise model where each qubit 
independently gets randomized with some probability q: 

p -»■ (l-q)p+q\ = {l-p)p+ P -{XpX+YpY+ZpZ) (5) 

where p = As seen from the r.h.s. of Eq. (J5|, we can 
equivalently describe this noise model as acting trivially 
on the qubit with probability 1 —p, or otherwise randomly 
applying one of the three Pauli operators. 

Because it anti-commutes with its two neighboring pla- 
quette operators B pi an X error will add two units of en- 
ergy to the system. We say that it causes two plaquette 



defects, which can be interpreted as particles. Similarly, 
a Z error creates a pair of site defects and a Y error 
creates a pair of both defects. In the case of multiple 
errors, the defects appear only at the endpoints of er- 
ror chains. Thus, distinct error chains with the same 
endpoints cannot be distinguished given the defect pat- 
tern they generate. If the union of two such error chains 
forms a contractible loop, their action on the codespace 
is identical, i.e. their product belongs to the group of 
contractible loops. Such errors are said to be degenerate. 
If, on the other hand, the union of two such error chains 
forms a non-contractible loop, they have a distinct effect 
on the codespace. Thus, the decoding algorithm must 
somehow decide the most likely homology class of the er- 
ror based on the knowledge of the defect pattern, also 
called the error syndrome. 

We now show how this inference problem can be 
mapped to a statistical mechanics problem [4] . Associate 
to each error chain c an energy E(c) = J\c\ + J'^defect(c). 
Here, \c\ denotes the weight of the error chain, i.e. the 
number of physical qubits on which it acts non-trivially. 
The second term ndefect(c) counts the number of mis- 
matches between defects and endpoints of c. The ratio 
J' I J encodes the level of confidence of our defect de- 
tection. Then, given the depolarizing error model, the 
probability of an error chain c given a fixed defect pat- 
tern is Pr(c) = e ) where the partition function is 

2(0) = S c e~^ E ^ and the Nishimori inverse tempera- 
ture is given by 

*-ib*I^. (6) 

J P 

In the following, we take J' — >• oo, meaning that we only 
consider error chains c in perfect agreement with the ob- 
served defect pattern. 

The decoder's task then consists in evaluating the 
probability of each of the 16 homology classes gen- 
erated from all combinations of the 4 non-contractible 
loops: 

Pr(n) = J2Pr(c)- (7) 

We can express this probability in terms of ther- 
modynamical quantities. Let Pr(c\Q) = j^rM Iq(c) 
denote the probability of an error string c condi- 
tioned on homology class where Iq is the indi- 
cator function of Q. The entropy and average en- 
ergy associated to this conditional probability are given 

by S(Q) = -^2cen Pr ( c \ n ) lnPr ( c \ n ) and E ( n ) = 
^2 ceQ Pr(c\Q)E(c) respectively. In terms of the Gibbs 
free energy (3F(Q) = (3E(Q) - 5(0), we obtain 

e -(3F(Q) 

PrW - - m -. (8) 
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Figure 1: Simple defect patterns illustrating the importance 
of the entropic term or error degeneracy (left) and correlations 
of errors (right). 

Thus, we see that the optimal decoding consists in choos- 
ing the homology class Q that minimizes the free energy. 

In those terms, we can clarify why the PMA is sub- 
optimal. By finding the shortest error chain compatible 
with the error syndrome, PMA minimizes energy rather 
than free energy. In other words, it operates at zero tem- 
perature rather than the Nishimori temperature. Figure 
[I] (left) illustrates a situation where the entropic term 
influences the decoding. Namely, two distinct homology 
classes (dash and full lines) have the same minimum en- 
ergy configuration, but one of them has higher entropy 
because it contains 4 degenerate errors, and therefore 
should be chosen for the decoding. In that case, the 
PMA has a smaller success probability than a free en- 
ergy minimization. Repeating this error pattern leads to 
situations where the difference in success probability can 
be arbitrarily large. 

Another limitation of the PMA is that it must consider 
X and Z errors independently. However, because a Y er- 
ror is the product of a X and a Z error, the two types 
of errors are highly correlated in a depolarizing model. 
Figure [l] (right) illustrates a situation where these corre- 
lations influence the decoding. Specifically, the top and 
bottom figure illustrate two possible error chains for a 
defect pattern. If the X and Z errors are treated as inde- 
pendent, then the upper configuration has lower energy. 
Taking into account the correlations of the depolarizing 
channel on the other hand favors the lower configuration. 
Again, more elaborate examples lead to situations with 
highly different success probabilities of the two methods. 

New Approach — The main idea behind the approach we 
propose is to approximate the toric code by a concate- 
nated code. A concatenated code is constructed by en- 
coding, say, one qubit into a code block of n qubits 
(n should be relatively small), and then encode each of 
these qubits into n qubits, etc. Clearly, the final num- 
ber of qubits used is exponential in the number of con- 
catenations, but the failure probability decreases doubly- 
exponentially. 



Figure 2: The 12 qubits that constitute a code block in 
the concatenation approximation. They form a surface code. 
Dashed links represent qubits shared between blocks. 

Concatenated codes can be efficiently and optimally 
decoded using a recursive algorithm [15]. First, we com- 
pute the probability of each logical operator (or homology 
class for topological codes) of the codes appearing in the 
last concatenation step. As in the previous section, this 
problem maps to the evaluation of a free energy for a sys- 
tem of finite size n, so can be solved by brute force. These 
probabilities become the noise model for the second-to- 
last layer of codes, replacing the depolarizing channel. In 
turn, these code blocks can be decoded by brute force. 
This process is repeated until we reach the top layer of 
the code, providing a probability distribution for the net 
error affecting encoded qubits. 

One can view each step of this decoding process as 
a RG transformation. The noise model evolves from one 
concatenation level to the next based on some simple pro- 
cedure. The transformation is not homogenous in space 
and time, reflecting the inhomogeneity of the observed 
error syndromes. 

It is noteworthy that all the codes appearing in a 
given concatenation layer can be decoded simultaneously. 
Hence, this decoding scheme is tailored for paralleliza- 
tion and therefore has a running time proportional to 
the number of concatenation layers, i.e., scaling logarith- 
mically with the number of qubits. The time to decode 
a single block scales exponentially with n (small). 

The toric code is not truly a concatenated code but 
can be viewed as one if we allow the different code blocks 
of a given concatenation layer to share qubits, i.e., to 
overlap with one another. One possible way of breaking 
the toric code into overlapping code blocks is illustrated 
on the l.h.s. of Fig. [2] The dashed edges represent qubits 
that are shared by two neighboring code blocks. 

Each of these code blocks is a small open-boundary 
topological code, so it can be decoded by brute force. The 
outcome is a probability distribution for the two logical 
qubits encoded in this small code. This RG procedure, 
schematically illustrated at Fig.|2j is the core idea behind 
our method. It maps an error model on 8 qubits to a 
renormalized error model on 2 qubits. The procedure is 
repeated by joining 4 pairs of such renormalized qubits to 
obtain a new block of 8 qubits. At the final iteration, we 
obtain the error distribution of the two qubits encoded 
in the toric code. 

Results & Improvements — We use Monte Carlo sampling 
to characterize the decoder's performance. The basic 
RG decoder described above achieves a depolarizing error 
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Figure 3: Failure probability as a function of bit-flip probabil- 
ity p for a £ = 8, 16, 32, . . . 1024 toric code decoded using the 
2x1 block RG algorithm with 3 belief propagation rounds. 
Each data point is 10 4 trials. 



threshold p ~ 7.8% compared to p ~ 15.5% obtained us- 
ing the PMA [9j [16] . This threshold can be significantly 
improved by imposing self-consistent conditions on the 
qubits that are shared between blocks. Suppose qubit Q 
is shared between a block on the left and a block on the 
right. Let Dl and Dr denote the defect pattern on the 
left and right code block respectively. Each block will as- 
sign a conditional error probability to Q Pt(Eq\Dji) and 
Pt(Eq\Dl) that are different in general. To improve our 
decoding algorithm, we impose that all such conditional 
probabilities agree. This leads to a set of mean-field self 
consistency equations that we solve by a belief propaga- 
tion algorithm. We obtain a threshold p ~ 15.2%, quite 
comparable to the one of the PMA. 

Our algorithm is very flexible and allows for mul- 
tiple tradeoffs between noise suppression and running 
time. Note that the running time of all the variations 
we present scales like log^, but the constant pre-factors 
greatly differ. For instance, we can replace the 2x2 
code block of the RG scheme by a 2 x 1 code block. 
This slightly deteriorates the threshold to p ~ 12.9%, 
but speeds up the decoding by a factor of a few hundred. 
We can also choose to ignore the correlations between X 
and Z errors and decode them separately as bit-flip and 
phase-flip channels. This is a more direct comparison 
with the PMA. Our method yields a threshold of p ~ 9% 
with 2 x 2 RG blocks and p ~ 8.2% with 2 x 1 RG blocks 
(see Fig. |3|, compared to p ~ 10.3% for the PMA, and 
speeds up our RG algorithm by a factor of a few thou- 
sands. Even without parallelization, this enables us to 
decode lattices a thousand sites long, c.f. Fig. [3j 

We can further improve these thresholds by combining 
the RG scheme with the sparse code decoder we pre- 
sented in [17]. We first use the sparse code decoder with 
the full (correlated) error model to update the X and 
Z error probabilities on each physical qubit. We then 
use the RG scheme to decode X and Z errors indepen- 
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Figure 4: Failure probability as a function of depolarizing 
strength p for a £ = 16, 32, 64, and 128 toric code decoded 
using the 2x2 subcode RG algorithm with three rounds of 
belief propagation with prior physical decoding. 

dently. The threshold achieved is p ~ 16.4% (see Fig.[4|, 
exceeding the one of the PMA. 

Conclusion — We presented a real-space RG method to 
decode the quantum information stored in the topolog- 
ical degrees of freedom of a system in the presence of 
noise. Our method is very versatile, faster than existing 
schemes, and tolerates a higher noise rate. It is also the 
first known algorithm to decode the color code. Beyond 
its immediate implication to quantum information, it 
opens the door to new numerical methods to study topo- 
logically ordered systems with quenched disorder (see [18] 
for a similar combination of RG and belief propagation 
in the context of condensed matter physics). 
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